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Introduction 

Because of the new read program, a number of facilities are 
being added to the LISP system to po**mit manlpulatior of single 
characters and print names. Machine-language functions have been 
provided for breaking print names down into a list of their 
characters, for forming a list of characters into a print name, 
for creating a numerical object from a list of its characters, 
for reading in characters one by on* from an Input medium, and for 
testing characters to see whether taey are letters, numbers, 
operation characters, etc. A number of auxiliary objects and sub- 
routines are also described in thlf memo. 
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Characters and Character Ob jects 

Each of -the. 64 6-bit binary numbers corresponds to a PCD 
character, If we include illegal characters. Therefore, in order 
to manipulate these characters via LISP functions, each of them 
has a corresponding object* Of the 64 characters, 48 correspond 
to characters on the keypunch, and the print name of an object 
corresponding to a keyptinch character is simply that character. 
The print names of the remaining characters will be described later. 
When a LISP function is described which has a character as either 
value or argument, we really mean that it has an object corresponding 
to a character as value or argument respectively. 

The first group of legal characters are the letters of the 
alphabet from A bo Z. Each letter is a legitimate atomic symbol, 
and therefore may be referred to in a straightforward way, without 

ambiguity. 

The second group of legal characters are the digits from 
to 9. These must be handled with some care, because if a digit 
is considered as an ordinary integer rather than a character, a 
new non-unique object will be created corresponding to it, and 
this object will not be the same as the character object for the 
same digit, even though ic has the same print name. Since the 
character-handling programs depend on the character objects being 
in specific locations, this will lead to error. 

Both the current read program RDA and the new read program 
have been arranged so that digits through 9 read in as the 
corresponding character objects. Sfcese may be used in arithmetic 
just like any other number, tut even though the result of an 
arithmetic operation lies between and 9, it will not point to 
the corresponding character object- Thus character objects for 
through 9 may be obtained only by reading them or by manipulation 
of print names , 
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The third group of legal characters are the special characters 
These correspond to the remaining characters m the keypunch, such 
as 'V & nd *■**• Since these characters are tio% legitimate atomic 
symbols, there is a set of special character tfftlue objects which 
can be used to refer to them. 

A typical special character value object, say DOLLAR, has the 

following structure: 
I 
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Thus "DOLLAR* has value "$"• ' Note that "I" 1* not a legitimate 
atomic symbol as far as the read program RDS is concerned, so that 
there is no way of reading In ,qUQfEfi t $) «1# ; -Rift. 

The special character value objects and their permanent 
values are: 



DOILAR 

SLASH 

LPAr. 

R?AR 

COKKA 

PERIOD 

P1.VSS 

DASH 

S'i'AP. 

BLANK 

ESjSIGN 
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blank 



The following examples illustrate the us* of theju> objects 
and their rais*on d'etre. Each example cons ia*& of a triplet for 
the APPT.Y operator followed by the result. 
Examplee: 

BVAL {DOLLAR; .())*$ 

EVAL ((PRim ! PERIOD) 0) .*Pto& : #&- ^ auscs "*" to h& P** 1 * 1 ****' 
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The remaining characters are all illegal as far as the key- 
punch is concerned,. The two characters corresponding to 12/gj 

and 72 /o\ have been reserved for end-of -file and end-of -record 

respectively. The end-of -file character has print name $EOF$ and 
the end-of- record character has print name $SOR$; corresponding to 
these character objects are two character value objects EOR and 
EOF, whose values are $EOR$ and $EOF$ respectively. The rest of 
the illegal character objects have print names corresponding to 
their octal representations preceded by $IL and followed by $. 
For instance, the character 77 i$\ corresponds to a character object 
with pirnt name $2L77$* 

The character objects are arranged in the machine so that 
their first cells occupy successive storage locations. Thus it 
is possible to go from a character to the corresponding object 
or conversely by a single addition or subtraction. This speeds up 
characters-handling considerably, because it isn't necessary to 
search property lists of character objects for their print names: 
the names may be deduced from the object locations „ 
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Paoking and Unpacking Characters 

When a sequence of characters is to be made into either a 
print name or a numerical object, the characters must be put one 
by one into a buffer called BOPFO. BOPPO is used to store the 
characters until they are to be combined* It is not available 
explicitly to the LISP programmer, but the character-packing 
functions are described In terms of their effects on BOPPO. At 
any point, BOPPO contains a sequence of characters. Each operation 
on BOPPO either adds another character at the end of the sequence 
or clears BOPPO, i.e., sets BOPPO to the null sequence. The 
maximum length of the sequence la 120 characters; ah attempt to 
add mere characters will cause an error. 
The character-packing functions are: 

1. pack (cl: 

The argument of pack must be a character object. 
Pack adds the oharacter c at the end of the sequence of 
characters in BOFPO. The value of pack Is NIL. 

2. cJLearbuff J 1 : 

Clearbuff is a function of no arguments,, It clears 
BOPPO and has value NIL. The contents of BOFPO are un- 
defined until a clearbuff has been performed „ 

3. mknaro []: 

Mknam is a function of no arguments. Its value is 
a list of full words containing the characters in EOPPO in 
packed BCD form. The last word is filled out with the 
Illegal character code 77 if necessary . After mknam is 
performed, BOPPO is automatically cleared. Note that 
intern [mknanU) yields the object whose print name is 
in BOFPO. 
4» numob []: 

Numob is a function of no arguments. Its value is 
the numerical object represented by the sequence of 
characters in BOPPO. Numob use3 the subroutine NU'MBfc 
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to obtain the number from its BCD representation, and 
the preciae conventions on numbers will be given later 
when NOMBR is described. Numob will accept floating- 
point decimal numbers, decimal integers, fixed-point 
decimal numbers, and octal numbers* Octal numbers created 
by numob have a pointer to the object OCTAL on their 
property lists. They are considered as 36-bit rather than 
35-bit numbers, so that applying numob to a negative octal 
number leads to the creation of a single numerical object 
which contains a pointer to a full word with 1 in the sign 
bit, rather than leading to a pair whose first element 13 
MINUS, as is the case with negative decimal numbers. 
(Positive decimal integers from to 9 are converted so 
as to point to the corresponding character object . ) 

5, unpack Q): 

This function has as argument a pointer to a full 
word. Unpack considers the full word to be a set of 
6 BCD characters, and has as value a list of these 
characters ignoring all characters including and following 
the first 77 

6 InternCpniBffia] 

This function hm as argument % pointer to a PSAME 

typa atructur 8 © such as • 
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EXAKPk I Ism?? ! 



Its value is the atomic symbol having thi» print xmm^ 
If it does not already exist, than a new atomic symbol 
will be created* 
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The Char acter - Classifyi ng Function s 
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Liter has as argument* a 
value is T if the character if a 
and F otherwise* 

2. digit [c]: 

Digit lias as argument ^a^ 
value is T if the character Ji a 
and F otherwise. 

3. opchar CcJ: 

OpcJar has as argUJ»«at * & 
value is X if the chamo.feff : : ii ■■§'1 
F otherwi&e. Qpchaf / tyeafci-l 

4. dash [oji 

Dats h has as ;a^paQ6n$;:p 
is y if uhe^charactei* ia"l 
8-4 punch minus, and F o* 



object .Its 

la|i#r of the alphabet, 



object. Its 
between and 9, 



fcer object. Its 

fei*i>::-«p », and . 
^Ilii signs equivalent ly . 

object. Its value 
minus or m 
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^^ The Character - Reading Functions 

The character-reading functions make it possible to read 
characters one by one from data records* The choice of input 
medium is determined by sense switch 1, though at a future time 
mode-set facilities may be added so that input may be taken from 
any medium* This description will be given in terms of hollerith 
cards as input; the procedure for tape is completely analogous. 
However, the tape records must be 72 characters long. 

There is an object CIJRCHAR whpse val\jMB is the character most 
recently read (as an object)* There is also an object CHARCOUNT 
whose value is an integer object giving the column just read on 
the card, l*e*, the column number of the character given by 
CttXCHAR; There are three functions which affect the value of 
CVRCHAH: 

1 „ startread C } : 

Start read Is a function of no arguments which 

/ ^ > causes a new card to be read. The value of sta rtrea d 

is the first character on that card, or more precisely, 
the object correspondirig to the first character on the 
card. If an end-of-ff-le condition exists, the value of 
startread is $EOF$* The value of CORCHAR becomes the 
same as the output of startread , and the value of 
CHARCOUNT becomes 1, Both CORCHAR and CHARCOUNT are 
undefined until a starsread is performed* A startread 
may be performed before the current card has been . 
completely read* 
? B advance [ J : 

Advance is a function of no arguments which causes 
the next character to be read. The value of advance is 
that character • Af£er the ?8nd character on the card has 
been read, the next advance will have value $EOR$c After 
reading $EOR$, the next advance will act like a startread, 
ice*, will read the first character of the next card 

^■^ unless an end-of-flle condition exists.* The new value 
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of CURCHAR is the same as the output of advance; 
executing advance also increases tho value of CHARCOUNT 
by 1. However, CHARCOUNT is undefined when CURCHAR is 
either $EOR$ or $EOP$. 
endread CI: 

Endread is a function of no arguments which causes 
the remainder of the cf$^^'^- : ieti0^)!iA-. ignored. 
Endread sets CURCHAR to $E8fl$ and ^fifes CHARCQUWT un- 
defined; the value of gMtaM 1» '*t|if* 4E0R|. An advance 



following an gndread e^M : tim,M^M ^^miA . It CURCHAR 



already has value jwmfc :80 
will remain the same and 
value $EOR$. 




r Mi. performed, CURCHAR 

B ^i as uaoal/ nave 
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1 «, errorl [ } : 

Errorl is a function of no arguments and has value 
ML. It should be executed only while rending characters 
from a card (or tape). Its effect is to n/ike the character 
Just read, i.e., CURCHAR, so that when the snd of the card 
is reached*, either by successive advanoea or by an endread, 
the entire card is printed out along with a visual pointer 
$o the defective character. For a line consisting of 
ABCDEKJ followed by blanks, a pointer to C Wou<d look 
like this: 

V 
ABCDEFG 
A 

If errorl is performed ar< even number of tines on tie 
same character, the A will not appear. If errorl is 
performed prior to the first start read or while C^CHra 
Bb«S vatijme $E0R$ or $E0R$,' it will have no effect, Exeeating 
a start read before the current card has been completed dill 
cause the errorl printout to be lost. The card is cons' dered 
to have been completed Wien CURCHAR has been set to $E0h$, 
Successive endreads wil3 cause the errorl printout to be 
reprinted. Any number of characters in a given line n*iy 
be marked by errorl . 
2. numnam Cxi: 

The argument x of numnam is a pointer to a cell 
containing a number. E/aluafcing numnam will cause that 
number, considered as a signed decimal integer, to be 
printed without terminating the print line. The value of 
numnam 1b Nil,. 
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^ Subroutine NUMBR 

This subroutine will convert a sequence of characters in 
packed BCD form into the number which they represent. The input 
conventions are generally those of SAP-PAP decimal input; however, 
provision is made for handling octal numbers also. This subroutine 
should not be used as a I/ISP function because the conventions on 
input and output are different from normal LISP usage. The routine 
is a modification of UA JDBC1. 

The routine will accept floating-point decimal numbers, decimal 
integers, fixed-point decimal numbers, and octal numbers. For 
decimal numbers of the three different types mentioned, the con- 
ventions are those of the DEC pseudo-operation in PAP, However, 
the routine will convert only one number each time it is entered; 
it considers a decimal number to be terminated when it encounters 
any character other than a digit, ".", V", "-", "E M , or "BV 
"q" must not appear immediately after a decimal integer. 

Octal numbers are followed by a Q„ and may be preceded by a 
^ sign. The U Q" may be followed by an unsigned scale factor n, 

which will cause the number to be shifted n octal places to the 
left, (n is Itself a decimal number.) The number is right- 
3 ratified, and overflows on the left are ignored. The termination 
rules are the same as for decimal numbers, although a second "Q" 
will not act as a terminator. 

Some examples of octal conversion are: 

Input Result 

-75Q % OQOGO 00075 

+63Q4 00000 6 30000 

77Q10 7 70000 00000 

-3<W- 4 0000C 30000 

Note that in both octal and decimal numbers, the two 
different minus signs are equivalent; and if the initial character 
is a terminator, the result is a dectsial fixed-point zero,, 
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r^ The input sequence of characters must be packed into successive 

N —- words in storage, 6 characters per word. However, the first sig- 
nificant character need not be the first charaoter of the word in 
which it occurs. 

In order to tell whether the input number is octal or decimal, 
the program first scans the characters in sequence until it en- 
counters a character other than a digit less than 8. If the first 
such character is Q, then the number is taken as octal; otherwise 
it is taken as decimal. This rule accounts in part for the peculiar 
behavior of "Q" as a terminator. 

The calling sequence is as follows: 

Let L be the location of the first packed word, and n the location 
of the first significant character within the word, counting from 
the left, with 1 < n «£6. Then the following word should be placed 
in the AC: 

PZE L,,n 
^-^ The subroutine is entered by the instruction 

TSX $NUMBR,4 

The resulting number appears in the W3. The AC will contain L and 
n for the character following the terminating character, so that 
for reading numbers separated by commas, say, the subroutine may 
be entered several times in succession, and the AC need not be 
reloaded each time. The sign of the AC will be • for floating-point 
decimal numbers and + for all other numbers; however, the sign of 
the AC is Ignored when the routine is entered. 
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